                         SEQUENCE LISTING

<110>  Universiteit Stellenbosch
       Den Haan, Riaan
       Van Zyl, Emile
       LaGrange, Danie
 
<120>  Heterologous Expression of Fungal Cellobiohydrolases in Yeast

<130>  2608.016PC01

<150>  US 61/052,213
<151>  2008-05-11

<160>  20    

<170>  PatentIn version 3.3

<210>  1
<211>  1704
<212>  DNA
<213>  Talaromyces emersonii cbh1 

<400>  1
ctcagactca aacactccat cagcagcttc gaaagcggtc tttttgctat catcatgctt     60

cgacgggctc ttcttctatc ctcttccgcc atccttgctg tcaaggcaca gcaggccggc    120

acggcgacgg cagagaacca cccgcccctg acatggcagg aatgcaccgc ccctgggagc    180

tgcaccaccc agaacggggc ggtcgttctt gatgcgaact ggcgttgggt gcacgatgtg    240

aacggataca ccaactgcta cacgggcaat acctgggacc ccacgtactg ccctgacgac    300

gaaacctgcg cccagaactg tgcgctggac ggcgcggatt acgagggcac ctacggcgtg    360

acttcgtcgg gcagctcctt gaaactcaat ttcgtcaccg ggtcgaacgt cggatcccgt    420

ctctacctgc tgcaggacga ctcgacctat cagatcttca agcttctgaa ccgcgagttc    480

agctttgacg tcgatgtctc caatcttccg tgcggattga acggcgctct gtactttgtc    540

gccatggacg ccgacggcgg cgtgtccaag tacccgaaca acaaggctgg tgccaagtac    600

ggaaccgggt attgcgactc ccaatgccca cgggacctca agttcatcga cggcgaggcc    660

aacgtcgagg gctggcagcc gtcttcgaac aacgccaaca ccggaattgg cgaccacggc    720

tcctgctgtg cggagatgga tgtctgggaa gcaaacagca tctccaatgc ggtcactccg    780

cacccgtgcg acacgccagg ccagacgatg tgctctggag atgactgcgg tggcacatac    840

tctaacgatc gctacgcggg aacctgcgat cctgacggct gtgacttcaa cccttaccgc    900

atgggcaaca cttctttcta cgggcctggc aagatcatcg ataccaccaa gcccttcact    960

gtcgtgacgc agttcctcac tgatgatggt acggatactg gaactctcag cgagatcaag   1020

cgcttctaca tccagaacag caacgtcatt ccgcagccca actcggacat cagtggcgtg   1080

accggcaact cgatcacgac ggagttctgc actgctcaga agcaggcctt tggcgacacg   1140

gacgacttct ctcagcacgg tggcctggcc aagatgggag cggccatgca gcagggtatg   1200

gtcctggtga tgagtttgtg ggacgactac gccgcgcaga tgctgtggtt ggattccgac   1260

tacccgacgg atgcggaccc cacgacccct ggtattgccc gtggaacgtg tccgacggac   1320

tcgggcgtcc catcggatgt cgagtcgcag agccccaact cctacgtgac ctactcgaac   1380

attaagtttg gtccgatcaa ctcgaccttc accgcttcgt gagtcttggt tacatttgaa   1440

gtagacggaa gtagctctgc gatggaactg gcatatggag aagaccacac aaaactgcat   1500

cgaagaaaag aggggggaaa agagaaaagc aaagttattt agtttgaaaa tgaaactacg   1560

ctcgttttta ttcttgaaaa tcgccactct tgcctttttt ttcttttttc tttttatttt   1620

ttttcctttt gaaatcttca atttaaatgt acatattgtt aaatcaaatc aagtaaatat   1680

acttgaaaaa aaaaaaaaaa aaaa                                          1704


<210>  2
<211>  2046
<212>  DNA
<213>  Humicola grisea cbh1

<400>  2
gccgtgacct tgcgcgcttt gggtggcggt ggcgagtcgt ggacggtgct tgctggtcgc     60

cggccttccc ggcgatccgc gtgatgagag ggccaccaac ggcgggatga tgctccatgg    120

ggaacttccc catggagaag agagagaaac ttgcggagcc gtgatctggg gaaagatgct    180

ccgtgtctcg tctatataac tcgagtctcc ccgagccctc aacaccacca gctctgatct    240

caccatcccc atcgacaatc acgcaaacac agcagttgtc gggccattcc ttcagacaca    300

tcagtcaccc tccttcaaaa tgcgtaccgc caagttcgcc accctcgccg cccttgtggc    360

ctcggccgcc gcccagcagg cgtgcagtct caccaccgag aggcaccctt ccctctcttg    420

gaacaagtgc accgccggcg gccagtgcca gaccgtccag gcttccatca ctctcgactc    480

caactggcgc tggactcacc aggtgtctgg ctccaccaac tgctacacgg gcaacaagtg    540

ggatactagc atctgcactg atgccaagtc gtgcgctcag aactgctgcg tcgatggtgc    600

cgactacacc agcacctatg gcatcaccac caacggtgat tccctgagcc tcaagttcgt    660

caccaagggc cagcactcga ccaacgtcgg ctcgcgtacc tacctgatgg acggcgagga    720

caagtatcag agtacgttct atcttcagcc ttctcgcgcc ttgaatcctg gctaacgttt    780

acacttcaca gccttcgagc tcctcggcaa cgagttcacc ttcgatgtcg atgtctccaa    840

catcggctgc ggtctcaacg gcgccctgta cttcgtctcc atggacgccg atggtggtct    900

cagccgctat cctggcaaca aggctggtgc caagtacggt accggctact gcgatgctca    960

gtgcccccgt gacatcaagt tcatcaacgg cgaggccaac attgagggct ggaccggctc   1020

caccaacgac cccaacgccg gcgcgggccg ctatggtacc tgctgctctg agatggatat   1080

ctgggaagcc aacaacatgg ctactgcctt cactcctcac ccttgcacca tcattggcca   1140

gagccgctgc gagggcgact cgtgcggtgg cacctacagc aacgagcgct acgccggcgt   1200

ctgcgacccc gatggctgcg acttcaactc gtaccgccag ggcaacaaga ccttctacgg   1260

caagggcatg accgtcgaca ccaccaagaa gatcactgtc gtcacccagt tcctcaagga   1320

tgccaacggc gatctcggcg agatcaagcg cttctacgtc caggatggca agatcatccc   1380

caactccgag tccaccatcc ccggcgtcga gggcaattcc atcacccagg actggtgcga   1440

ccgccagaag gttgcctttg gcgacattga cgacttcaac cgcaagggcg gcatgaagca   1500

gatgggcaag gccctcgccg gccccatggt cctggtcatg tccatctggg atgaccacgc   1560

ctccaacatg ctctggctcg actcgacctt ccctgtcgat gccgctggca agcccggcgc   1620

cgagcgcggt gcctgcccga ccacctcggg tgtccctgct gaggttgagg ccgaggcccc   1680

caacagcaac gtcgtcttct ccaacatccg cttcggcccc atcggctcga ccgttgctgg   1740

tctccccggc gcgggcaacg gcggcaacaa cggcggcaac cccccgcccc ccaccaccac   1800

cacctcctcg gctccggcca ccaccaccac cgccagcgct ggccccaagg ctggccgctg   1860

gcagcagtgc ggcggcatcg gcttcactgg cccgacccag tgcgaggagc cctacatttg   1920

caccaagctc aacgactggt actctcagtg cctgtaaatt ctgagtcgct gactcgacga   1980

tcacggccgg tttttgcatg aaaggaaaca aacgaccgcg ataaaaatgg agggtaatga   2040

gatgtc                                                              2046


<210>  3
<211>  3459
<212>  DNA
<213>  Thermoascus aurantiacus cbh1

<400>  3
gaattctaga cctttatcct ttcatccgac cagacttccc tttttgacct tggcgccctg     60

ttgactacct acctacctag gtagtaacgt cgtcgaccct cttgaatgat ccttgtcaca    120

ctgcaaacat ccgaaaacat acggcaaaag atgattgggc atggatgcag gagacatcga    180

atgagggctt agaaggaaat gaaaacctgg gaccaggacg ctaggtacga tgaaatccgc    240

caatggtgaa actttaagtc gtgcctacag cacaggctct gtgaagattg cgctgttcag    300

acttaatctt ctcatcacag tccaagtctt tatgaaaagg aaaaagagag ggaagagcgc    360

tatttcgagc tgttggcctc atagggagac agtcgagcat accagcggta tcgacgttag    420

actcaaccaa gaataatgac gagaataaac acagaagtca accttgaact ggatagcagg    480

gttccagcag cagatagtta cttgcataaa gacaactccc cgagggctct ctgcatacac    540

caggatgttc cggaattatt cactgctcgt ttccgacgtg gcgtcagtga tccgtctcca    600

cagaactcta cctgggaata acccagggga ggaatctgca agtaagaact taataccaat    660

ccccggggct gccgaggtga atcgaatctc ccgcgggaaa ttaaacccat acgatgtttt    720

tgcaccacat gcatgcttag cacgatttct ccgcaaggga gtcacagaga aagacatatt    780

tcgcatacta ctgtgactct gcagagttac atatcactca ggatacattg cagatcattg    840

tccgggcatc aaaaatggac ctgcaggatc aacggcccga caaaacacaa gtggctaaag    900

ctgggggatg cccgaaaccc tctggtgcaa tatcatttga tggatgttcc ccccgcattt    960

ctaagacatc gacggatcgg cccgcatact aatcctttta tcaaccaaaa gttccactcg   1020

actagagaaa aaaaaggcca aggccactag ttgcagtcgg atactggtct tttcgccgtc   1080

caacaccttc atccatgatc cccttagcca ccaatgcccc acataataca tgttgacata   1140

ggtacgtagc tctgttatcc aatcggatcc gaacctcttt aacggacccc tcctacacac   1200

cttatcctaa cttcagaaga ctgttgccca ttggggattg aggaggtccg ggtcgcagga   1260

tgcgttctag gctaaattct cggccggtag ccatctcgaa tctctcgtga agccttcatc   1320

tgaacggttg gcggcccgtc aagccgatga ccatgggttc ctgatagagc ttgtgcctga   1380

ccggccttgg cggcatagac gagctgaaca catcaggtat gaacagatca gatataaagt   1440

cggattgagt cctagtacga agcaatccgc caccaccaaa tcaagcaacg agcgacacga   1500

ataacaatat caatcgaatc gcaatgtatc agcgcgctct tctcttctct ttcttcctcg   1560

ccgccgcccg cgcgcacgag gccggtaccg taaccgcaga gaatcaccct tccctgacct   1620

ggcagcaatg ctccagcggc ggtagttgta ccacgcagaa tggaaaagtc gttatcgatg   1680

cgaactggcg ttgggtccat accacctctg gatacaccaa ctgctacacg ggcaatacgt   1740

gggacaccag tatctgtccc gacgacgtga cctgcgctca gaattgtgcc ttggatggag   1800

cggattacag tggcacctat ggtgttacga ccagtggcaa cgccctgaga ctgaactttg   1860

tcacccaaag ctcagggaag aacattggct cgcgcctgta cctgctgcag gacgacacca   1920

cttatcagat cttcaagctg ctgggtcagg agtttacctt cgatgtcgac gtctccaatc   1980

tcccttgcgg gctgaacggc gccctctact ttgtggccat ggacgccgac ggcaatttgt   2040

ccaaataccc tggcaacaag gcaggcgcta agtatggcac tggttactgc gactctcagt   2100

gccctcggga tctcaagttc atcaacggtc aggtacgtca gaagtgataa ctagccagca   2160

gagcccatga atcattaact aacgctgtca aatacaggcc aacgttgaag gctggcagcc   2220

gtctgccaac gacccaaatg ccggcgttgg taaccacggt tcctcgtgcg ctgagatgga   2280

tgtctgggaa gccaacagca tctctactgc ggtgacgcct cacccatgcg acacccccgg   2340

ccagaccatg tgccagggag acgactgtgg tggaacctac tcctccactc gatatgctgg   2400

tacctgcgac cctgatggct gcgacttcaa tccttaccag ccaggcaacc actcgttcta   2460

cggccccggg aagatcgtcg acactagctc caaattcacc gtcgtcaccc agttcatcac   2520

cgacgacggg acaccctccg gcaccctgac ggagatcaaa cgcttctacg tccagaacgg   2580

caaggtgatc ccccagtcgg agtcgacgat cagcggcgtc accggcaact caatcaccac   2640

cgagtattgc acggcccaga aggcagcctt cggcgacaac accggcttct tcacgcacgg   2700

cgggcttcag aagatcagtc aggctctggc tcagggcatg gtcctcgtca tgagcctgtg   2760

ggacgatcac gccgccaaca tgctctggct ggacagcacc tacccgactg atgcggaccc   2820

ggacacccct ggcgtcgcgc gcggtacctg ccccacgacc tccggcgtcc cggccgacgt   2880

tgagtcgcag aaccccaatt catatgttat ctactccaac atcaaggtcg gacccatcaa   2940

ctcgaccttc accgccaact aagtaagtaa cgggcactct accaccgaga gcttcgtgaa   3000

gatacagggg tagttgggag attgtcgtgt acaggggaca tgcgatgctc aaaaatctac   3060

atcagtttgc caattgaacc atgaagaaaa gggggagatc aaagaagtct gtcagaagag   3120

aggggctgtg gcagcttaag ccttgttgta gatcgttcag agaaaaaaaa agtttgcgta   3180

cttattatat taggtcgatc attatccgat tgactccgtg acaagaatta aaaagagtac   3240

tgcttgcttg cctatttaaa ttgttatata cgccgtagcg cttgcggacc acccctcaca   3300

gtatatcggt tcgcctcttc ttgtctcttc atctcacatc acaggtccag gtccagcccg   3360

gcccggtccg ggtgccatgc atgcacaggg ggactaatat attaatcgtg accctgtvcc   3420

taagctaggg tccctgcatt ttgaacctgt ggacgtctg                          3459


<210>  4
<211>  2220
<212>  DNA
<213>  Trichoderma reesei cbh1

<400>  4
aaggttagcc aagaacaata gccgataaag atagcctcat taaacggaat gagctagtag     60

gcaaagtcag cgaatgtgta tatataaagg ttcgaggtcc gtgcctccct catgctctcc    120

ccatctactc atcaactcag atcctccagg agacttgtac accatctttt gaggcacaga    180

aacccaatag tcaaccgcgg actggcatca tgtatcggaa gttggccgtc atcacggcct    240

tcttggccac agctcgtgct cagtcggcct gcactctcca atcggagact cacccgcctc    300

tgacatggca gaaatgctcg tctggtggca cttgcactca acagacaggc tccgtggtca    360

tcgacgccaa ctggcgctgg actcacgcta cgaacagcag cacgaactgc tacgatggca    420

acacttggag ctcgacccta tgtcctgaca acgagacctg cgcgaagaac tgctgtctgg    480

acggtgccgc ctacgcgtcc acgtacggag ttaccacgag cggtaacagc ctctccattg    540

gctttgtcac ccagtctgcg cagaagaacg ttggcgctcg cctttacctt atggcgagcg    600

acacgaccta ccaggaattc accctgcttg gcaacgagtt ctctttcgat gttgatgttt    660

cgcagctgcc gtaagtgact taccatgaac ccctgacgta tcttcttgtg ggctcccagc    720

tgactggcca atttaaggtg cggcttgaac ggagctctct acttcgtgtc catggacgcg    780

gatggtggcg tgagcaagta tcccaccaac aacgctggcg ccaagtacgg cacggggtac    840

tgtgacagcc agtgtccccg cgatctgaag ttcatcaatg gccaggccaa cgttgagggc    900

tgggagccgt catccaacaa cgcaaacacg ggcattggag gacacggaag ctgctgctct    960

gagatggata tctgggaggc caactccatc tccgaggctc ttacccccca cccttgcacg   1020

actgtcggcc aggagatctg cgagggtgat gggtgcggcg gaacttactc cgataacaga   1080

tatggcggca cttgcgatcc cgatggctgc gactggaacc cataccgcct gggcaacacc   1140

agcttctacg gccctggctc aagctttacc ctcgatacca ccaagaaatt gaccgttgtc   1200

acccagttcg agacgtcggg tgccatcaac cgatactatg tccagaatgg cgtcactttc   1260

cagcagccca acgccgagct tggtagttac tctggcaacg agctcaacga tgattactgc   1320

acagctgagg agacagaatt cggcggatct ctttctcaga caagggcggc ctgactcagt   1380

tcaagaaggc tacctctggc ggcatggttc tggtcatgag tctgtgggat gatgtgagtt   1440

tgatggacaa acatgcgcgt tgacaaagag tcaagcagct gactgagatg ttacagtact   1500

acgccaacat gctgtggctg gactccacct acccgacaaa cgagacctcc tccacacccg   1560

gtgccgtgcg cggaagctgc tccaccagct ccggtgtccc tgctcaggtc gaatctcagt   1620

ctcccaacgc caaggtcacc ttctccaaca tcaagttcgg acccattggc agcaccggca   1680

accctagcgg cggcaaccct cccggcggaa accgtggcac caccaccacc cgccgcccag   1740

ccactaccac tggaagctct cccggaccta cccagtctca ctacggccag tgcggcggta   1800

ttggctacag cggccccacg gtctgcgcca gcggcacaac ttgccaggtc ctgaaccctt   1860

actactctca gtgcctgtaa agctccgtgc gaaagcctga cgcaccggta gattcttggt   1920

gagcccgtat catgacggcg gcgggagcta catggccccg ggtgatttat tttttttgta   1980

tctacttctg acccttttca aatatacggt caactcatct ttcactggag atgcggcctg   2040

cttggtattg cgatgttgtc agcttggcaa attgtggctt tcgaaaacac aaaacgattc   2100

cttagtagcc atgcatttta agataacgga atagaagaaa gaggaaatta aaaaaaaaaa   2160

aaaaacaaac atcccgttca taacccgtag aatcgccgct cttcgtgtat cccagtacca   2220


<210>  5
<211>  2369
<212>  DNA
<213>  Talaromyces emersonii cbh2


<220>
<221>  misc_feature
<222>  (298)..(298)
<223>  n is a, c, g, or t

<400>  5
gacggacctg cacttagtcg gtaggttatg tatgtagctg gagattggga tagggaagtt     60

agctaatagt ctacttcgtg tgagggttga ttttgatggt cgacagtatt cgtttcttat    120

acgcagcgtc atggatctgt gtttctgtca catgtcgggt ggatggttcc tggacagcag    180

cacacaaatg gtgttctgta gataggcgat actcggcagg ggattgtgca ggggattgta    240

tcgtagatgg ttctagtaaa atagatcccg agtatggtta gctctcatac ctcgagtnga    300

tgaagcacaa tatgctacga tatgccaagt aaaactctat tgtattctgc agctagcaat    360

tgaagaatcc gacattccca ttgtcatcta atcgggcaga catgtgcaaa gagggacgat    420

tcgtgatcga agtgctccaa tccatggcgt aggaccagac agctccatcc gatctagagc    480

tatatggagc tcctcgcaac tccgacactc cgcgagacag ctctcacaag cactataaat    540

atggccaaga accctgcaga acagcttcac tctacagccc gttgagcaga acaaacaaaa    600

tatcactcca gagagaaagc aacatgcgga atcttcttgc tcttgcaccg gccgcgctgc    660

ttgtcggcgc agcggaagcg caacaatccc tctggggaca atgtgagcag ctcctaaacg    720

tctgtctgag ggattatgtc tgactgctca ggcggcggga gttcgtggac tggcgcgacg    780

agctgtgctg ctggagcgac gtgcagcaca atcaatcctt gtacgtctgc tgaacgataa    840

tcctacattg ttgacgtgct aactgcgtag actacgcaca atgcgttcct gcaacggcca    900

ctccgaccac gctgacgaca acgacaaaac caacgtccac cggcggcgct gctccaacga    960

ctcctcctcc gacaacgact ggaacaacga catcgcccgt cgtcaccagg cccgcgtctg   1020

cctccggcaa cccgttcgaa ggctaccagc tctacgccaa tccgtactat gcgtcggagg   1080

tgattagttt ggcaattccc tcgctgagca gcgagctggt tcccaaggcg agcgaggtgg   1140

ccaaggtgcc gtctttcgtc tggctgtaag taaattcccc caggctgtca tttcccctta   1200

ctgatcttgt ccagcgacca agccgccaag gtgcccagca tgggcgacta tctgaaagac   1260

atccagtcgc agaacgcagc cggcgcagac cccccgattg caggcatctt tgtcgtctac   1320

gacctgcctg accgcgactg cgcggctgca gccagcaatg gcgagttctc catcgccaac   1380

aacggcgtcg ccctgtacaa gcagtacatc gactcgatcc gcgagcagct gacgacctat   1440

tcagatgtgc acaccatcct ggtcatcggt agttccagtc ctcttctgtg atgttgatga   1500

aaaaaatact gactgactcc tgcagaaccc gacagccttg cgaacgtggt caccaacctg   1560

aacgtgccga aatgcgcaaa tgcccaggac gcctatctcg aatgcatcaa ctacgccatc   1620

acccagctcg atctgccaaa cgtggccatg tatcttgatg ctggtgagtc ctcacataca   1680

agtgaataaa aataaaactg atgcagtgca ggacacgccg gatggctagg ctggcaagcc   1740

aacctcgccc ccgccgccca gctgtttgcc tcggtgtaca aaaacgcctc ctctccggca   1800

tccgtccgcg gtctcgccac caacgtcgcc aactacaacg cctggtcgat cagccggtgc   1860

ccgtcgtaca cgcagggcga cgccaattgc gacgaggagg attacgtgaa tgccttgggg   1920

ccgttgttcc aggaacaggg attcccggca tattttatca ttgatacatg taagctttac   1980

cccagaaccc ctccatagaa ggtcaatcta acggtaatgt acagcccgca atggcgtccg   2040

acccaccaag caaagccaat ggggcgactg gtgcaacgtc atcggcacgg gcttcggcgt   2100

ccggcccacg accgacaccg gcaatcctct cgaggacgct ttcgtctggg tcaagcccgg   2160

tggcgagagc gatggcacgt ccaacacgac ctctccgcgg tacgactacc actgcgggct   2220

gagcgatgcg ctgcagccgg cgccggaggc ggggacttgg ttccaggtat gacgcgcctt   2280

cgtattagca attacgatac atgtgcatgc tgaccatgcg acaggcgtac tttgagcagt   2340

tgctcacgaa tgctaacccg ctgttctga                                     2369


<210>  6
<211>  2193
<212>  DNA
<213>  Trichoderma reesei cbh2

<400>  6
tcgaactgac aagttgttat attgcctgtg taccaagcgc gaatgtggac aggattaatg     60

ccagagttca ttagcctcaa gtagagccta tttcctcgcc ggaaagtcat ctctcttatt    120

gcatttctgc ccttcccact aactcagggt gcagcgcaac actacacgca acatatacac    180

tttattagcc gtgcaacaag gctattctac gaaaaatgct acactccaca tgttaaaggc    240

gcattcaacc agcttcttta ttgggtaata tacagccagg cggggatgaa gctcattagc    300

cgccactcaa ggctatacaa tgttgccaac tctccgggct ttatcctgtg ctcccgaata    360

ccacatcgtg atgatgcttc agcgcacgga agtcacagac accgcctgta taaaaggggg    420

actgtgaccc tgtatgaggc gcaacatggt ctcacagcag ctcacctgaa gaggcttgta    480

agatcaccct ctgtgtattg caccatgatt gtcggcattc tcaccacgct ggctacgctg    540

gccacactcg cagctagtgt gcctctagag gagcggcaag cttgctcaag cgtctggtaa    600

ttatgtgaac cctctcaaga gacccaaata ctgagatatg tcaaggggcc aatgtggtgg    660

ccagaattgg tcgggtccga cttgctgtgc ttccggaagc acatgcgtct actccaacga    720

ctattactcc cagtgtcttc ccggcgctgc aagctcaagc tcgtccacgc gcgccgcgtc    780

gacgacttct cgagtatccc ccacaacatc ccggtcgagc tccgcgacgc ctccacctgg    840

ttctactact accagagtac ctccagtcgg atcgggaacc gctacgtatt caggcaaccc    900

ttttgttggg gtcactcctt gggccaatgc atattacgcc tctgaagtta gcagcctcgc    960

tattcctagc ttgactggag ccatggccac tgctgcagca gctgtcgcaa aggttccctc   1020

ttttatgtgg ctgtaggtcc tcccggaacc aaggcaatct gttactgaag gctcatcatt   1080

cactgcagag atactcttga caagacccct ctcatggagc aaaccttggc cgacatccgc   1140

accgccaaca agaatggcgg taactatgcc ggacagtttg tggtgtatga cttgccggat   1200

cgcgattgcg ctgcccttgc ctcgaatggc gaatactcta ttgccgatgg tggcgtcgcc   1260

aaatataaga actatatcga caccattcgt caaattgtcg tggaatattc cgatatccgg   1320

accctcctgg ttattggtga gtttaaacac ctgcctcccc ccccccttcc cttcctttcc   1380

cgccggcatc ttgtcgttgt gctaactatt gttccctctt ccagagcctg actctcttgc   1440

caacctggtg accaacctcg gtactccaaa gtgtgccaat gctcagtcag cctaccttga   1500

gtgcatcaac tacgccgtca cacagctgaa ccttccaaat gttgcgatgt atttggacgc   1560

tggccatgca ggatggcttg gctggccggc aaaccaagac ccggccgctc agctatttgc   1620

aaatgtttac aagaatgcat cgtctccgag agctcttcgc ggattggcaa ccaatgtcgc   1680

caactacaac gggtggaaca ttaccagccc cccatcgtac acgcaaggca acgctgtcta   1740

caacgagaag ctgtacatcc acgctattgg acctcttctt gccaatcacg gctggtccaa   1800

cgccttcttc atcactgatc aaggtcgatc gggaaagcag cctaccggac agcaacagtg   1860

gggagactgg tgcaatgtga tcggcaccgg atttggtatt cgcccatccg caaacactgg   1920

ggactcgttg ctggattcgt ttgtctgggt caagccaggc ggcgagtgtg acggcaccag   1980

cgacagcagt gcgccacgat ttgactccca ctgtgcgctc ccagatgcct tgcaaccggc   2040

gcctcaagct ggtgcttggt tccaagccta ctttgtgcag cttctcacaa acgcaaaccc   2100

atcgttcctg taaggctttc gtgaccgggc ttcaaacaat gatgtgcgat ggtgtggttc   2160

ccggttggcg gagtctttgt ctactttggt tgt                                2193


<210>  7
<211>  1590
<212>  DNA
<213>  Humicola grisea cbh1

<400>  7
gaattcatga gaaccgctaa gttcgctacc ttggctgcct tggttgcctc tgctgctgct     60

caacaagcct gttccttgac tactgaacgt cacccatctt tgtcttggaa caagtgtact    120

gctggtggtc aatgtcaaac tgtccaagcc tccatcactt tggactctaa ttggagatgg    180

acccaccaag tctctggtag tactaactgt tacaccggta ataagtggga cacttctatt    240

tgtactgacg ctaagtcttg tgctcaaaat tgttgtgttg atggtgctga ttacacctcc    300

acttatggta ttaccaccaa cggtgactct ttgtccttga agttcgttac taaaggtcaa    360

cattccacca acgtcggttc tagaacctac ttaatggacg gtgaagacaa gtaccaaacc    420

ttcgaattgt tgggtaatga atttaccttc gatgtcgatg tgtctaacat cggttgtggt    480

ttgaacggtg ctttatactt cgtttctatg gacgccgacg gtggtttgtc tcgttaccca    540

ggtaataagg ctggtgccaa gtatggtacc ggttactgtg atgctcaatg cccaagagac    600

attaagttca tcaacggtga agctaacatt gaaggttgga ctggttctac caacgaccca    660

aacgctggcg ccggtagata cggtacctgt tgttccgaaa tggacatttg ggaagccaac    720

aacatggcta ctgcttttac tccacaccca tgtaccatca ttggtcaatc cagatgtgaa    780

ggtgactcct gtggcggtac ctactccaac gaaagatacg ctggtgtttg tgatccagac    840

ggttgtgact tcaactccta cagacaaggt aacaagactt tctatggtaa gggtatgact    900

gtcgatacca ccaagaagat caccgtcgtc acccaattct tgaaggacgc taacggtgat    960

ttaggtgaaa ttaaaagatt ctacgtccaa gatggtaaga tcatcccaaa ctctgaatct   1020

accattccag gtgttgaagg taattccatc actcaagact ggtgtgacag acaaaaggtt   1080

gccttcggtg atattgacga cttcaacaga aagggtggta tgaagcaaat gggtaaggct   1140

ttggccggtc caatggtctt ggttatgtct atttgggacg atcacgcttc caacatgttg   1200

tggttggact ccaccttccc agttgatgct gctggtaagc caggtgccga aagaggtgct   1260

tgtccaacta cttccggtgt cccagctgaa gttgaagccg aagctccaaa ttctaacgtt   1320

gtcttctcta acatcagatt cggtccaatc ggttccacag tcgctggttt gccaggtgct   1380

ggtaatggtg gtaataacgg tggtaaccca ccaccaccaa ccactaccac ttcttctgcc   1440

ccagctacta ccaccaccgc ttctgctggt ccaaaggctg gtagatggca acaatgtggt   1500

ggtattggtt tcaccggtcc aacccaatgt gaagaaccat acatctgtac caagttgaac   1560

gactggtact ctcaatgttt ataactcgag                                    1590


<210>  8
<211>  1383
<212>  DNA
<213>  Thermoascus aurantiacus cbh1

<400>  8
gaattcatgt accaaagagc tctattgttc tccttcttct tggccgccgc tagagctcat     60

gaagccggta ctgtcaccgc cgaaaaccac ccatccttga cttggcaaca atgttcctct    120

ggtggttctt gtactactca aaacgggaag gttgttattg acgctaactg gagatgggtt    180

cacactacct ccggttacac caactgttac actggtaaca cttgggatac ttccatctgt    240

ccagacgacg ttacctgtgc tcaaaactgt gctttggacg gtgctgacta ctccggtact    300

tacggtgtca ctacctctgg caacgcgttg agattgaact tcgtcaccca atcttctggt    360

aagaacatcg gttctagatt gtacttgttg caagacgata ctacttacca aatcttcaag    420

ttgttgggtc aagagttcac tttcgacgtt gatgtttcca acttgccttg tggtttgaac    480

ggtgctttgt acttcgttgc tatggacgcc gacggtaact tatccaagta cccaggtaac    540

aaggccggtg ccaagtacgg taccggttac tgtgattctc aatgtccaag agacctaaaa    600

ttcattaacg gtcaagctaa cgtcgaaggt tggcaaccat ctgctaacga tccaaacgcc    660

ggtgtcggta atcacggttc ctcctgtgct gaaatggacg tttgggaagc taactctatc    720

tccaccgccg tcactccaca tccatgtgat accccaggtc aaaccatgtg tcaaggtgat    780

gattgtggtg gtacctactc ttccactaga tacgctggta cctgtgacac cgacggttgt    840

gatttcaacc cataccaacc aggtaaccac tctttctacg gtccaggtaa gattgtcgat    900

acttcttcta agttcactgt tgtcactcaa ttcattaccg acgatggtac cccatctggt    960

accctaactg aaattaagag attctacgtc caaaacggta aagtcattcc acaatccgaa   1020

agcaccattt ccggtgttac cggtaactcc atcaccactg aatactgtac cgctcaaaag   1080

gccgcctttg acaacaccgg tttcttcacc catggtggtt tgcaaaagat ttctcaagcc   1140

ttggctcaag gtatggtttt ggtcatgtcc ttgtgggatg accacgctgc taacatgttg   1200

tggttggatt ctacttaccc aactgacgct gatccagaca ccccaggtgt tgctagaggt   1260

acttgtccaa ccacttctgg tgttccagct gacgtcgaat ctcaaaaccc taactcttac   1320

gttatctact ctaacatcaa ggtgggtcca attaactcca ccttcactgc taactaactc   1380

gag                                                                 1383


<210>  9
<211>  1379
<212>  DNA
<213>  Talaromyces emersonii cbh1

<400>  9
gaattcatgc taagaagagc tttactattg agctcttctg ctatcttggc cgttaaggct     60

caacaagccg gtaccgctac tgctgaaaac caccctccat tgacctggca agaatgtacc    120

gctccaggtt cttgtaccac ccaaaacggt gctgtcgtct tggacgctaa ctggagatgg    180

gtccacgacg tcaacggtta cactaactgt tacaccggta acacctggga cccaacttac    240

tgtccagacg acgaaacttg cgctcaaaac tgtgccttgg acggtgctga ctacgaaggt    300

acttacggtg ttacctcctc tggttcttcc ttgaagttga acttcgtcac tggttctaac    360

gtcggttcca gattgtattt gttgcaagat gactccactt accaaatctt caagttgttg    420

aacagagaat tttctttcga cgtcgatgtg tccaacttgc cttgtggttt gaacggtgct    480

ctatacttcg ttgctatgga cgctgatggt ggtgtttcca agtacccaaa caacaaggct    540

ggtgccaaat acggtactgg ttactgtgac tctcaatgtc cacgtgactt gaagtttatt    600

gatggtgaag ctaatgtcga aggttggcaa ccatcttcta acaacgctaa cactggcatc    660

ggtgaccacg gttcttgctg tgccgaaatg gacgtttggg aagccaactc catttccaac    720

gccgtcactc cacacccatg tgacactcca ggtcaaacta tgtgttccgg cgatgactgt    780

ggtggtactt actctaacga tagatacgct ggtacctgtg atccagacgg ttgcgacttc    840

aatccataca gaatgggtaa cacttccttt tacggtccag gcaagatcat cgacactact    900

aagccattca ctgttgtcac ccaattcttg accgacgatg gtactgatac cggtactttg    960

tccgaaatca agagattcta catccaaaac tctaacgtca tcccacaacc aaattccgac   1020

atctctggtg tcactggtaa ctccattacc accgaatttt gtaccgccca aaagcaagct   1080

ttcggtgaca ccgacgactt ctctcaacac ggtggtttgg ctaagatggg tgctgctatg   1140

caacaaggta tggttttggt catgtctttg tgggacgact acgctgctca aatgttgtgg   1200

ttggactccg attacccaac cgatgccgac ccaaccaccc ctggtatcgc tagaggtacc   1260

tgtccaactg actctggtgt tccatctgac gtcgaatccc aatctccaaa ctcctacgtc   1320

acttactcca acattaaatt ggtccaatca actccacttt cactgcttct taactcgag    1379


<210>  10
<211>  1392
<212>  DNA
<213>  Talaromyces emersonii cbh2

<400>  10
gaattcatgc gtaacttgtt ggccttggct ccagccgctt tgttggttgg tgctgccgaa     60

gctcaacaat ccttgtgggg tcaatgcggt ggttcctcct ggactggtgc aacttcctgt    120

gccgctggtg ccacctgttc caccattaac ccatactacg ctcaatgtgt tccagccact    180

gccactccaa ctaccttgac taccaccact aagccaacct ccaccggtgg tgctgctcca    240

accactccac caccaactac taccggtact accacctctc cagtcgtcac cagacctgcc    300

tccgcctccg gtaatccatt cgaaggttat caattgtacg ctaaccctta ctacgcttct    360

gaagtcattt ccttggctat cccatctttg agctccgagt tggtcccaaa ggcctccgaa    420

gttgctaagg tcccttcatt tgtctggtta gatcaagctg ccaaggttcc atctatgggt    480

gattacttga aggatattca atctcaaaac gctgctggtg ctgatccacc aatcgccggt    540

attttcgttg tttacgattt gccagataga gactgtgccg ccgctgcttc taacggtgaa    600

ttttctatcg ccaacaacgg tgtcgcttta tacaaacaat atatcgattc cattagagaa    660

caattaacca cttactccga cgtccatacc atcttggtta tcgaaccaga ctctttggct    720

aacgttgtca ctaacttgaa cgttccaaaa tgtgctaacg ctcaagatgc ttacttggaa    780

tgtatcaact acgctattac ccaattggac ttgccaaacg ttgctatgta cttggacgct    840

ggtcacgccg gttggttggg ttggcaagcc aacttggccc cagctgctca attattcgct    900

tctgtttaca agaacgcctc ttccccagcc tctgttagag gtttggctac caacgtggct    960

aactacaacg cctggtccat ttctagatgt ccatcctaca ctcaaggtga cgctaactgt   1020

gatgaagaag attacgttaa cgctttgggt ccattgttcc aagaacaagg tttcccagct   1080

tacttcatca tcgacacttc ccgtaacggt gtcagaccaa ctaagcaatc tcaatggggt   1140

gactggtgta acgttattgg taccggtttc ggtgttagac caaccaccga cactggtaac   1200

ccattggaag acgctttcgt ttgggtcaag ccaggtggtg aatccgacgg tacctccaac   1260

actactagcc cacgttacga ttaccactgt ggtttgtctg acgctttgca accagctcca   1320

gaagctggta cctggttcca agcctacttc gaacaattgt tgactaacgc caacccattg   1380

ttctaactcg ag                                                       1392


<210>  11
<211>  525
<212>  PRT
<213>  Humicola grisea cbh1

<400>  11

Met Arg Thr Ala Lys Phe Ala Thr Leu Ala Ala Leu Val Ala Ser Ala 
1               5                   10                  15      


Ala Ala Gln Gln Ala Cys Ser Leu Thr Thr Glu Arg His Pro Ser Leu 
            20                  25                  30          


Ser Trp Asn Lys Cys Thr Ala Gly Gly Gln Cys Gln Thr Val Gln Ala 
        35                  40                  45              


Ser Ile Thr Leu Asp Ser Asn Trp Arg Trp Thr His Gln Val Ser Gly 
    50                  55                  60                  


Ser Thr Asn Cys Tyr Thr Gly Asn Lys Trp Asp Thr Ser Ile Cys Thr 
65                  70                  75                  80  


Asp Ala Lys Ser Cys Ala Gln Asn Cys Cys Val Asp Gly Ala Asp Tyr 
                85                  90                  95      


Thr Ser Thr Tyr Gly Ile Thr Thr Asn Gly Asp Ser Leu Ser Leu Lys 
            100                 105                 110         


Phe Val Thr Lys Gly Gln His Ser Thr Asn Val Gly Ser Arg Thr Tyr 
        115                 120                 125             


Leu Met Asp Gly Glu Asp Lys Tyr Gln Thr Phe Glu Leu Leu Gly Asn 
    130                 135                 140                 


Glu Phe Thr Phe Asp Val Asp Val Ser Asn Ile Gly Cys Gly Leu Asn 
145                 150                 155                 160 


Gly Ala Leu Tyr Phe Val Ser Met Asp Ala Asp Gly Gly Leu Ser Arg 
                165                 170                 175     


Tyr Pro Gly Asn Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp 
            180                 185                 190         


Ala Gln Cys Pro Arg Asp Ile Lys Phe Ile Asn Gly Glu Ala Asn Ile 
        195                 200                 205             


Glu Gly Trp Thr Gly Ser Thr Asn Asp Pro Asn Ala Gly Ala Gly Arg 
    210                 215                 220                 


Tyr Gly Thr Cys Cys Ser Glu Met Asp Ile Trp Glu Ala Asn Asn Met 
225                 230                 235                 240 


Ala Thr Ala Phe Thr Pro His Pro Cys Thr Ile Ile Gly Gln Ser Arg 
                245                 250                 255     


Cys Glu Gly Asp Ser Cys Gly Gly Thr Tyr Ser Asn Glu Arg Tyr Ala 
            260                 265                 270         


Gly Val Cys Asp Pro Asp Gly Cys Asp Phe Asn Ser Tyr Arg Gln Gly 
        275                 280                 285             


Asn Lys Thr Phe Tyr Gly Lys Gly Met Thr Val Asp Thr Thr Lys Lys 
    290                 295                 300                 


Ile Thr Val Val Thr Gln Phe Leu Lys Asp Ala Asn Gly Asp Leu Gly 
305                 310                 315                 320 


Glu Ile Lys Arg Phe Tyr Val Gln Asp Gly Lys Ile Ile Pro Asn Ser 
                325                 330                 335     


Glu Ser Thr Ile Pro Gly Val Glu Gly Asn Ser Ile Thr Gln Asp Trp 
            340                 345                 350         


Cys Asp Arg Gln Lys Val Ala Phe Gly Asp Ile Asp Asp Phe Asn Arg 
        355                 360                 365             


Lys Gly Gly Met Lys Gln Met Gly Lys Ala Leu Ala Gly Pro Met Val 
    370                 375                 380                 


Leu Val Met Ser Ile Trp Asp Asp His Ala Ser Asn Met Leu Trp Leu 
385                 390                 395                 400 


Asp Ser Thr Phe Pro Val Asp Ala Ala Gly Lys Pro Gly Ala Glu Arg 
                405                 410                 415     


Gly Ala Cys Pro Thr Thr Ser Gly Val Pro Ala Glu Val Glu Ala Glu 
            420                 425                 430         


Ala Pro Asn Ser Asn Val Val Phe Ser Asn Ile Arg Phe Gly Pro Ile 
        435                 440                 445             


Gly Ser Thr Val Ala Gly Leu Pro Gly Ala Gly Asn Gly Gly Asn Asn 
    450                 455                 460                 


Gly Gly Asn Pro Pro Pro Pro Thr Thr Thr Thr Ser Ser Ala Pro Ala 
465                 470                 475                 480 


Thr Thr Thr Thr Ala Ser Ala Gly Pro Lys Ala Gly Arg Trp Gln Gln 
                485                 490                 495     


Cys Gly Gly Ile Gly Phe Thr Gly Pro Thr Gln Cys Glu Glu Pro Tyr 
            500                 505                 510         


Ile Cys Thr Lys Leu Asn Asp Trp Tyr Ser Gln Cys Leu 
        515                 520                 525 


<210>  12
<211>  456
<212>  PRT
<213>  Thermoascus aurantiacus cbh1

<400>  12

Met Tyr Gln Arg Ala Leu Leu Phe Ser Phe Phe Leu Ala Ala Ala Arg 
1               5                   10                  15      


Ala His Glu Ala Gly Thr Val Thr Ala Glu Asn His Pro Ser Leu Thr 
            20                  25                  30          


Trp Gln Gln Cys Ser Ser Gly Gly Ser Cys Thr Thr Gln Asn Gly Lys 
        35                  40                  45              


Val Val Ile Asp Ala Asn Trp Arg Trp Val His Thr Thr Ser Gly Tyr 
    50                  55                  60                  


Thr Asn Cys Tyr Thr Gly Asn Thr Trp Asp Thr Ser Ile Cys Pro Asp 
65                  70                  75                  80  


Asp Val Thr Cys Ala Gln Asn Cys Ala Leu Asp Gly Ala Asp Tyr Ser 
                85                  90                  95      


Gly Thr Tyr Gly Val Thr Thr Ser Gly Asn Ala Leu Arg Leu Asn Phe 
            100                 105                 110         


Val Thr Gln Ser Ser Gly Lys Asn Ile Gly Ser Arg Leu Tyr Leu Leu 
        115                 120                 125             


Gln Asp Asp Thr Thr Tyr Gln Ile Phe Lys Leu Leu Gly Gln Glu Phe 
    130                 135                 140                 


Thr Phe Asp Val Asp Val Ser Asn Leu Pro Cys Gly Leu Asn Gly Ala 
145                 150                 155                 160 


Leu Tyr Phe Val Ala Met Asp Ala Asp Gly Asn Leu Ser Lys Tyr Pro 
                165                 170                 175     


Gly Asn Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln 
            180                 185                 190         


Cys Pro Arg Asp Leu Lys Phe Ile Asn Gly Gln Ala Asn Val Glu Gly 
        195                 200                 205             


Trp Gln Pro Ser Ala Asn Asp Pro Asn Ala Gly Val Gly Asn His Gly 
    210                 215                 220                 


Ser Ser Cys Ala Glu Met Asp Val Trp Glu Ala Asn Ser Ile Ser Thr 
225                 230                 235                 240 


Ala Val Thr Pro His Pro Cys Asp Thr Pro Gly Gln Thr Met Cys Gln 
                245                 250                 255     


Gly Asp Asp Cys Gly Gly Thr Tyr Ser Ser Thr Arg Tyr Ala Gly Thr 
            260                 265                 270         


Cys Asp Thr Asp Gly Cys Asp Phe Asn Pro Tyr Gln Pro Gly Asn His 
        275                 280                 285             


Ser Phe Tyr Gly Pro Gly Lys Ile Val Asp Thr Ser Ser Lys Phe Thr 
    290                 295                 300                 


Val Val Thr Gln Phe Ile Thr Asp Asp Gly Thr Pro Ser Gly Thr Leu 
305                 310                 315                 320 


Thr Glu Ile Lys Arg Phe Tyr Val Gln Asn Gly Lys Val Ile Pro Gln 
                325                 330                 335     


Ser Glu Ser Thr Ile Ser Gly Val Thr Gly Asn Ser Ile Thr Thr Glu 
            340                 345                 350         


Tyr Cys Thr Ala Gln Lys Ala Ala Phe Asp Asn Thr Gly Phe Phe Thr 
        355                 360                 365             


His Gly Gly Leu Gln Lys Ile Ser Gln Ala Leu Ala Gln Gly Met Val 
    370                 375                 380                 


Leu Val Met Ser Leu Trp Asp Asp His Ala Ala Asn Met Leu Trp Leu 
385                 390                 395                 400 


Asp Ser Thr Tyr Pro Thr Asp Ala Asp Pro Asp Thr Pro Gly Val Ala 
                405                 410                 415     


Arg Gly Thr Cys Pro Thr Thr Ser Gly Val Pro Ala Asp Val Glu Ser 
            420                 425                 430         


Gln Asn Pro Asn Ser Tyr Val Ile Tyr Ser Asn Ile Lys Val Gly Pro 
        435                 440                 445             


Ile Asn Ser Thr Phe Thr Ala Asn 
    450                 455     


<210>  13
<211>  455
<212>  PRT
<213>  Talaromyces emersonii cbh1

<400>  13

Met Leu Arg Arg Ala Leu Leu Leu Ser Ser Ser Ala Ile Leu Ala Val 
1               5                   10                  15      


Lys Ala Gln Gln Ala Gly Thr Ala Thr Ala Glu Asn His Pro Pro Leu 
            20                  25                  30          


Thr Trp Gln Glu Cys Thr Ala Pro Gly Ser Cys Thr Thr Gln Asn Gly 
        35                  40                  45              


Ala Val Val Leu Asp Ala Asn Trp Arg Trp Val His Asp Val Asn Gly 
    50                  55                  60                  


Tyr Thr Asn Cys Tyr Thr Gly Asn Thr Trp Asp Pro Thr Tyr Cys Pro 
65                  70                  75                  80  


Asp Asp Glu Thr Cys Ala Gln Asn Cys Ala Leu Asp Gly Ala Asp Tyr 
                85                  90                  95      


Glu Gly Thr Tyr Gly Val Thr Ser Ser Gly Ser Ser Leu Lys Leu Asn 
            100                 105                 110         


Phe Val Thr Gly Ser Asn Val Gly Ser Arg Leu Tyr Leu Leu Gln Asp 
        115                 120                 125             


Asp Ser Thr Tyr Gln Ile Phe Lys Leu Leu Asn Arg Glu Phe Ser Phe 
    130                 135                 140                 


Asp Val Asp Val Ser Asn Leu Pro Cys Gly Leu Asn Gly Ala Leu Tyr 
145                 150                 155                 160 


Phe Val Ala Met Asp Ala Asp Gly Gly Val Ser Lys Tyr Pro Asn Asn 
                165                 170                 175     


Lys Ala Gly Ala Lys Tyr Gly Thr Gly Tyr Cys Asp Ser Gln Cys Pro 
            180                 185                 190         


Arg Asp Leu Lys Phe Ile Asp Gly Glu Ala Asn Val Glu Gly Trp Gln 
        195                 200                 205             


Pro Ser Ser Asn Asn Ala Asn Thr Gly Ile Gly Asp His Gly Ser Cys 
    210                 215                 220                 


Cys Ala Glu Met Asp Val Trp Glu Ala Asn Ser Ile Ser Asn Ala Val 
225                 230                 235                 240 


Thr Pro His Pro Cys Asp Thr Pro Gly Gln Thr Met Cys Ser Gly Asp 
                245                 250                 255     


Asp Cys Gly Gly Thr Tyr Ser Asn Asp Arg Tyr Ala Gly Thr Cys Asp 
            260                 265                 270         


Pro Asp Gly Cys Asp Phe Asn Pro Tyr Arg Met Gly Asn Thr Ser Phe 
        275                 280                 285             


Tyr Gly Pro Gly Lys Ile Ile Asp Thr Thr Lys Pro Phe Thr Val Val 
    290                 295                 300                 


Thr Gln Phe Leu Thr Asp Asp Gly Thr Asp Thr Gly Thr Leu Ser Glu 
305                 310                 315                 320 


Ile Lys Arg Phe Tyr Ile Gln Asn Ser Asn Val Ile Pro Gln Pro Asn 
                325                 330                 335     


Ser Asp Ile Ser Gly Val Thr Gly Asn Ser Ile Thr Thr Glu Phe Cys 
            340                 345                 350         


Thr Ala Gln Lys Gln Ala Phe Gly Asp Thr Asp Asp Phe Ser Gln His 
        355                 360                 365             


Gly Gly Leu Ala Lys Met Gly Ala Ala Met Gln Gln Gly Met Val Leu 
    370                 375                 380                 


Val Met Ser Leu Trp Asp Asp Tyr Ala Ala Gln Met Leu Trp Leu Asp 
385                 390                 395                 400 


Ser Asp Tyr Pro Thr Asp Ala Asp Pro Thr Thr Pro Gly Ile Ala Arg 
                405                 410                 415     


Gly Thr Cys Pro Thr Asp Ser Gly Val Pro Ser Asp Val Glu Ser Gln 
            420                 425                 430         


Ser Pro Asn Ser Tyr Val Thr Tyr Ser Asn Ile Lys Phe Gly Pro Ile 
        435                 440                 445             


Asn Ser Thr Phe Thr Ala Ser 
    450                 455 


<210>  14
<211>  459
<212>  PRT
<213>  Talaromyces emersonii cbh2

<400>  14

Met Arg Asn Leu Leu Ala Leu Ala Pro Ala Ala Leu Leu Val Gly Ala 
1               5                   10                  15      


Ala Glu Ala Gln Gln Ser Leu Trp Gly Gln Cys Gly Gly Ser Ser Trp 
            20                  25                  30          


Thr Gly Ala Thr Ser Cys Ala Ala Gly Ala Thr Cys Ser Thr Ile Asn 
        35                  40                  45              


Pro Tyr Tyr Ala Gln Cys Val Pro Ala Thr Ala Thr Pro Thr Thr Leu 
    50                  55                  60                  


Thr Thr Thr Thr Lys Pro Thr Ser Thr Gly Gly Ala Ala Pro Thr Thr 
65                  70                  75                  80  


Pro Pro Pro Thr Thr Thr Gly Thr Thr Thr Ser Pro Val Val Thr Arg 
                85                  90                  95      


Pro Ala Ser Ala Ser Gly Asn Pro Phe Glu Gly Tyr Gln Leu Tyr Ala 
            100                 105                 110         


Asn Pro Tyr Tyr Ala Ser Glu Val Ile Ser Leu Ala Ile Pro Ser Leu 
        115                 120                 125             


Ser Ser Glu Leu Val Pro Lys Ala Ser Glu Val Ala Lys Val Pro Ser 
    130                 135                 140                 


Phe Val Trp Leu Asp Gln Ala Ala Lys Val Pro Ser Met Gly Asp Tyr 
145                 150                 155                 160 


Leu Lys Asp Ile Gln Ser Gln Asn Ala Ala Gly Ala Asp Pro Pro Ile 
                165                 170                 175     


Ala Gly Ile Phe Val Val Tyr Asp Leu Pro Asp Arg Asp Cys Ala Ala 
            180                 185                 190         


Ala Ala Ser Asn Gly Glu Phe Ser Ile Ala Asn Asn Gly Val Ala Leu 
        195                 200                 205             


Tyr Lys Gln Tyr Ile Asp Ser Ile Arg Glu Gln Leu Thr Thr Tyr Ser 
    210                 215                 220                 


Asp Val His Thr Ile Leu Val Ile Glu Pro Asp Ser Leu Ala Asn Val 
225                 230                 235                 240 


Val Thr Asn Leu Asn Val Pro Lys Cys Ala Asn Ala Gln Asp Ala Tyr 
                245                 250                 255     


Leu Glu Cys Ile Asn Tyr Ala Ile Thr Gln Leu Asp Leu Pro Asn Val 
            260                 265                 270         


Ala Met Tyr Leu Asp Ala Gly His Ala Gly Trp Leu Gly Trp Gln Ala 
        275                 280                 285             


Asn Leu Ala Pro Ala Ala Gln Leu Phe Ala Ser Val Tyr Lys Asn Ala 
    290                 295                 300                 


Ser Ser Pro Ala Ser Val Arg Gly Leu Ala Thr Asn Val Ala Asn Tyr 
305                 310                 315                 320 


Asn Ala Trp Ser Ile Ser Arg Cys Pro Ser Tyr Thr Gln Gly Asp Ala 
                325                 330                 335     


Asn Cys Asp Glu Glu Asp Tyr Val Asn Ala Leu Gly Pro Leu Phe Gln 
            340                 345                 350         


Glu Gln Gly Phe Pro Ala Tyr Phe Ile Ile Asp Thr Ser Arg Asn Gly 
        355                 360                 365             


Val Arg Pro Thr Lys Gln Ser Gln Trp Gly Asp Trp Cys Asn Val Ile 
    370                 375                 380                 


Gly Thr Gly Phe Gly Val Arg Pro Thr Thr Asp Thr Gly Asn Pro Leu 
385                 390                 395                 400 


Glu Asp Ala Phe Val Trp Val Lys Pro Gly Gly Glu Ser Asp Gly Thr 
                405                 410                 415     


Ser Asn Thr Thr Ser Pro Arg Tyr Asp Tyr His Cys Gly Leu Ser Asp 
            420                 425                 430         


Ala Leu Gln Pro Ala Pro Glu Ala Gly Thr Trp Phe Gln Ala Tyr Phe 
        435                 440                 445             


Glu Gln Leu Leu Thr Asn Ala Asn Pro Leu Phe 
    450                 455                 


<210>  15
<211>  1608
<212>  DNA
<213>  Trichoderma reesei cbh1

<400>  15
atggtctcct tcacctccct gctggccggc gttgccgcta tctctggtgt cctagcagcc     60

cctgccgcag aagttgaacc tgtcgcagtt gagaaacgtg aggccgaagc agaagctcaa    120

tccgcttgta ccctacaatc cgaaactcac ccaccattga cctggcaaaa gtgttctagc    180

ggtggaactt gtactcaaca aactggttct gttgttatcg acgctaactg gagatggaca    240

cacgccacta actcttctac caactgttac gacggtaaca cttggtcttc cactttatgt    300

ccagataacg aaacttgtgc taagaattgc tgtttggacg gtgccgccta cgcttctacc    360

tacggtgtta ccacctccgg taactccttg tctattggtt tcgtcactca atccgctcaa    420

aagaacgttg gtgctagatt gtacttgatg gcttctgaca ctacttatca agaatttact    480

ttgttgggta acgaattttc tttcgatgtt gacgtttccc aattgccatg tggcttgaac    540

ggtgctttgt actttgtctc tatggatgct gacggtggtg tttctaagta cccaactaac    600

actgccggtg ctaagtacgg tactggttac tgtgattctc aatgtccacg tgacttgaag    660

ttcattaacg gtcaagccaa cgtcgaaggt tgggaaccat cctccaacaa cgctaacacc    720

ggtatcggtg gtcacggttc ctgttgttcc gaaatggaca tctgggaagc taacagtatt    780

tctgaagctt tgacaccaca cccatgcacc actgtcggtc aagaaatttg tgaaggtgat    840

ggatgtggtg gaacctactc tgataacaga tacggtggta cttgtgaccc agacggttgt    900

gactggaacc catacagatt gggtaacact tctttctatg gtccaggttc ttctttcacc    960

ttggatacca ccaagaagtt gactgttgtt acccaattcg aaacttctgg tgctatcaac   1020

agatactacg ttcaaaacgg tgtcaccttc caacaaccaa acgctgaatt gggttcttac   1080

tctggtaatg aattgaacga cgactactgt accgctgaag aagctgaatt tggtggttcc   1140

tctttctccg acaagggtgg tttgacccaa ttcaagaagg ctacctccgg tggtatggtt   1200

ttggttatgt ccttgtggga tgattactac gcaaacatgt tatggttaga cagtacttac   1260

ccaactaacg aaacctcctc tactccaggt gctgtcagag gttcctgttc tacctcttct   1320

ggtgttccag ctcaagttga atctcaatct ccaaacgcta aggtcacttt ctccaacatc   1380

aagttcggtc caatcggttc cactggtaat ccatctggtg gaaaccctcc aggtggtaac   1440

agaggtacta ccactactcg taggccagct actacaactg gttcttcccc aggcccaacc   1500

caatcccact acggtcaatg tggtggtatc ggttactctg gtccaaccgt ctgtgcttct   1560

ggtactacct gtcaagtttt aaacccatac tactctcaat gtttgtaa                1608


<210>  16
<211>  1479
<212>  DNA
<213>  Trichoderma reesei cbh2

<400>  16
atggtctcct tcacctccct gctggccggc gttgccgcta tctctggtgt cctagcagcc     60

cctgccgcag aagttgaacc tgtcgcagtt gagaaacgtg aggccgaagc agaagctgtc    120

ccattagaag aaagacaagc ctgctcctct gtttggggtc aatgtggtgg tcaaaactgg    180

tctggtccaa cttgttgtgc ttccggttct acctgtgttt actccaacga ctactattcc    240

caatgtttgc caggtgctgc ttcctcttcc tcttcaacta gagctgcttc tacaacttct    300

agggtctccc caaccacttc cagatcctct tctgctactc caccaccagg ttctactacc    360

actagagttc caccagtcgg ttccggtact gctacttact ctggtaaccc tttcgtcggt    420

gttactccat gggctaacgc ttactacgct tctgaagttt cttctttggc tatcccatct    480

ttgactggtg ctatggctac cgctgctgct gctgtcgcca aagttccatc cttcatgtgg    540

ttggacacct tggacaaaac tccattaatg gaacaaacct tggcagacat aaggactgct    600

aacaagaacg gcggtaacta cgctggtcaa tttgttgtgt acgacttgcc agacagagac    660

tgtgctgctt tggcttccaa cggtgaatac tccatcgctg acggtggtgt cgccaagtac    720

aagaactaca ttgataccat tagacaaatc gttgtcgaat actctgacat cagaaccttg    780

ttagtcatcg aaccagattc tttagccaat ttagtcacca acttgggtac tccaaagtgt    840

gctaacgctc aatctgccta cttagaatgt atcaattatg cagttaccca attgaacttg    900

ccaaacgttg ctatgtactt ggacgctggt cacgccggtt ggttgggttg gccagctaac    960

caagacccag ccgctcaatt attcgccaac gtttacaaga atgcctcttc tcctagagcc   1020

ttgcgtggtt tggctactaa cgtcgctaac tacaacggtt ggaacatcac ttctccacca   1080

tcttacaccc aaggtaacgc tgtttacaac gaaaagttgt acattcacgc tatcggtcca   1140

ttattggcta accatggttg gtctaacgcc ttcttcatca ccgaccaagg tagatccggt   1200

aaacaaccaa ctggtcaaca acaatggggt gattggtgta acgtcatcgg tactggtttc   1260

ggtatcagac catccgctaa cactggtgat tccttgttgg attccttcgt ctgggttaag   1320

ccaggtggtg aatgtgatgg cacctctgat tcctctgctc caagattcga ttcccactgc   1380

gccttgccag acgctttgca accagcccca caagctggtg catggttcca agcttacttt   1440

gtccaattgt tgaccaacgc taacccatct ttcttgtaa                          1479


<210>  17
<211>  535
<212>  PRT
<213>  Trichoderma reesei cbh1

<400>  17

Met Val Ser Phe Thr Ser Leu Leu Ala Gly Val Ala Ala Ile Ser Gly 
1               5                   10                  15      


Val Leu Ala Ala Pro Ala Ala Glu Val Glu Pro Val Ala Val Glu Lys 
            20                  25                  30          


Arg Glu Ala Glu Ala Glu Ala Gln Ser Ala Cys Thr Leu Gln Ser Glu 
        35                  40                  45              


Thr His Pro Pro Leu Thr Trp Gln Lys Cys Ser Ser Gly Gly Thr Cys 
    50                  55                  60                  


Thr Gln Gln Thr Gly Ser Val Val Ile Asp Ala Asn Trp Arg Trp Thr 
65                  70                  75                  80  


His Ala Thr Asn Ser Ser Thr Asn Cys Tyr Asp Gly Asn Thr Trp Ser 
                85                  90                  95      


Ser Thr Leu Cys Pro Asp Asn Glu Thr Cys Ala Lys Asn Cys Cys Leu 
            100                 105                 110         


Asp Gly Ala Ala Tyr Ala Ser Thr Tyr Gly Val Thr Thr Ser Gly Asn 
        115                 120                 125             


Ser Leu Ser Ile Gly Phe Val Thr Gln Ser Ala Gln Lys Asn Val Gly 
    130                 135                 140                 


Ala Arg Leu Tyr Leu Met Ala Ser Asp Thr Thr Tyr Gln Glu Phe Thr 
145                 150                 155                 160 


Leu Leu Gly Asn Glu Phe Ser Phe Asp Val Asp Val Ser Gln Leu Pro 
                165                 170                 175     


Cys Gly Leu Asn Gly Ala Leu Tyr Phe Val Ser Met Asp Ala Asp Gly 
            180                 185                 190         


Gly Val Ser Lys Tyr Pro Thr Asn Thr Ala Gly Ala Lys Tyr Gly Thr 
        195                 200                 205             


Gly Tyr Cys Asp Ser Gln Cys Pro Arg Asp Leu Lys Phe Ile Asn Gly 
    210                 215                 220                 


Gln Ala Asn Val Glu Gly Trp Glu Pro Ser Ser Asn Asn Ala Asn Thr 
225                 230                 235                 240 


Gly Ile Gly Gly His Gly Ser Cys Cys Ser Glu Met Asp Ile Trp Glu 
                245                 250                 255     


Ala Asn Ser Ile Ser Glu Ala Leu Thr Pro His Pro Cys Thr Thr Val 
            260                 265                 270         


Gly Gln Glu Ile Cys Glu Gly Asp Gly Cys Gly Gly Thr Tyr Ser Asp 
        275                 280                 285             


Asn Arg Tyr Gly Gly Thr Cys Asp Pro Asp Gly Cys Asp Trp Asn Pro 
    290                 295                 300                 


Tyr Arg Leu Gly Asn Thr Ser Phe Tyr Gly Pro Gly Ser Ser Phe Thr 
305                 310                 315                 320 


Leu Asp Thr Thr Lys Lys Leu Thr Val Val Thr Gln Phe Glu Thr Ser 
                325                 330                 335     


Gly Ala Ile Asn Arg Tyr Tyr Val Gln Asn Gly Val Thr Phe Gln Gln 
            340                 345                 350         


Pro Asn Ala Glu Leu Gly Ser Tyr Ser Gly Asn Glu Leu Asn Asp Asp 
        355                 360                 365             


Tyr Cys Thr Ala Glu Glu Ala Glu Phe Gly Gly Ser Ser Phe Ser Asp 
    370                 375                 380                 


Lys Gly Gly Leu Thr Gln Phe Lys Lys Ala Thr Ser Gly Gly Met Val 
385                 390                 395                 400 


Leu Val Met Ser Leu Trp Asp Asp Tyr Tyr Ala Asn Met Leu Trp Leu 
                405                 410                 415     


Asp Ser Thr Tyr Pro Thr Asn Glu Thr Ser Ser Thr Pro Gly Ala Val 
            420                 425                 430         


Arg Gly Ser Cys Ser Thr Ser Ser Gly Val Pro Ala Gln Val Glu Ser 
        435                 440                 445             


Gln Ser Pro Asn Ala Lys Val Thr Phe Ser Asn Ile Lys Phe Gly Pro 
    450                 455                 460                 


Ile Gly Ser Thr Gly Asn Pro Ser Gly Gly Asn Pro Pro Gly Gly Asn 
465                 470                 475                 480 


Arg Gly Thr Thr Thr Thr Arg Arg Pro Ala Thr Thr Thr Gly Ser Ser 
                485                 490                 495     


Pro Gly Pro Thr Gln Ser His Tyr Gly Gln Cys Gly Gly Ile Gly Tyr 
            500                 505                 510         


Ser Gly Pro Thr Val Cys Ala Ser Gly Thr Thr Cys Gln Val Leu Asn 
        515                 520                 525             


Pro Tyr Tyr Ser Gln Cys Leu 
    530                 535 


<210>  18
<211>  471
<212>  PRT
<213>  Trichoderma reesei cbh2

<400>  18

Met Ile Val Gly Ile Leu Thr Thr Leu Ala Thr Leu Ala Thr Leu Ala 
1               5                   10                  15      


Ala Ser Val Pro Leu Glu Glu Arg Gln Ala Cys Ser Ser Val Trp Gly 
            20                  25                  30          


Gln Cys Gly Gly Gln Asn Trp Ser Gly Pro Thr Cys Cys Ala Ser Gly 
        35                  40                  45              


Ser Thr Cys Val Tyr Ser Asn Asp Tyr Tyr Ser Gln Cys Leu Pro Gly 
    50                  55                  60                  


Ala Ala Ser Ser Ser Ser Ser Thr Arg Ala Ala Ser Thr Thr Ser Arg 
65                  70                  75                  80  


Val Ser Pro Thr Thr Ser Arg Ser Ser Ser Ala Thr Pro Pro Pro Gly 
                85                  90                  95      


Ser Thr Thr Thr Arg Val Pro Pro Val Gly Ser Gly Thr Ala Thr Tyr 
            100                 105                 110         


Ser Gly Asn Pro Phe Val Gly Val Thr Pro Trp Ala Asn Ala Tyr Tyr 
        115                 120                 125             


Ala Ser Glu Val Ser Ser Leu Ala Ile Pro Ser Leu Thr Gly Ala Met 
    130                 135                 140                 


Ala Thr Ala Ala Ala Ala Val Ala Lys Val Pro Ser Phe Met Trp Leu 
145                 150                 155                 160 


Asp Thr Leu Asp Lys Thr Pro Leu Met Glu Gln Thr Leu Ala Asp Ile 
                165                 170                 175     


Arg Thr Ala Asn Lys Asn Gly Gly Asn Tyr Ala Gly Gln Phe Val Val 
            180                 185                 190         


Tyr Asp Leu Pro Asp Arg Asp Cys Ala Ala Leu Ala Ser Asn Gly Glu 
        195                 200                 205             


Tyr Ser Ile Ala Asp Gly Gly Val Ala Lys Tyr Lys Asn Tyr Ile Asp 
    210                 215                 220                 


Thr Ile Arg Gln Ile Val Val Glu Tyr Ser Asp Ile Arg Thr Leu Leu 
225                 230                 235                 240 


Val Ile Glu Pro Asp Ser Leu Ala Asn Leu Val Thr Asn Leu Gly Thr 
                245                 250                 255     


Pro Lys Cys Ala Asn Ala Gln Ser Ala Tyr Leu Glu Cys Ile Asn Tyr 
            260                 265                 270         


Ala Val Thr Gln Leu Asn Leu Pro Asn Val Ala Met Tyr Leu Asp Ala 
        275                 280                 285             


Gly His Ala Gly Trp Leu Gly Trp Pro Ala Asn Gln Asp Pro Ala Ala 
    290                 295                 300                 


Gln Leu Phe Ala Asn Val Tyr Lys Asn Ala Ser Ser Pro Arg Ala Leu 
305                 310                 315                 320 


Arg Gly Leu Ala Thr Asn Val Ala Asn Tyr Asn Gly Trp Asn Ile Thr 
                325                 330                 335     


Ser Pro Pro Ser Tyr Thr Gln Gly Asn Ala Val Tyr Asn Glu Lys Leu 
            340                 345                 350         


Tyr Ile His Ala Ile Gly Arg Leu Leu Ala Asn His Gly Trp Ser Asn 
        355                 360                 365             


Ala Phe Phe Ile Thr Asp Gln Gly Arg Ser Gly Lys Gln Pro Thr Gly 
    370                 375                 380                 


Gln Gln Gln Trp Gly Asp Trp Cys Asn Val Ile Gly Thr Gly Phe Gly 
385                 390                 395                 400 


Ile Arg Pro Ser Ala Asn Thr Gly Asp Ser Leu Leu Asp Ser Phe Val 
                405                 410                 415     


Trp Val Lys Pro Gly Gly Glu Cys Asp Gly Thr Ser Asp Ser Ser Ala 
            420                 425                 430         


Pro Arg Phe Asp Ser His Cys Ala Leu Pro Asp Ala Leu Gln Pro Ala 
        435                 440                 445             


Ala Gln Ala Gly Ala Trp Phe Gln Ala Tyr Phe Val Gln Leu Leu Thr 
    450                 455                 460                 


Asn Ala Asn Pro Ser Phe Leu 
465                 470     


<210>  19
<211>  148
<212>  DNA
<213>  Artificial Sequence

<220>
<223>  Synthetic xyn2 secretion signal

<400>  19
gaattcttaa ttaaaaacaa aatggtctcc ttcacctccc tgctggccgg cgttgccgct     60

atctctggtg tcctagcagc ccctgccgca gaagttgaac ctgtcgcagt tgagaaacgt    120

gaggccgaag cagaagctcc cgggactc                                       148


<210>  20
<211>  39
<212>  PRT
<213>  Artificial Sequence 

<220>
<223>  Synthetic xyn2 secretion signal

<400>  20

Met Val Ser Phe Thr Ser Leu Leu Ala Gly Val Ala Ala Ile Ser Gly 
1               5                   10                  15      


Val Leu Ala Ala Pro Ala Ala Glu Val Glu Pro Val Ala Val Glu Lys 
            20                  25                  30          


Arg Glu Ala Glu Ala Glu Ala 
        35                  


